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Abstract 

It is well known that, under broad assumptions, the time-scaled point process 
of exceedances of a high level by a stationary sequence converges to a compound 
Poisson process as the level grows. The purpose of this note is to demonstrate 
that, for any given distribution G on N, there exists a stationary sequence for 
which the compounding law of this limiting process of exceedances will coincide 
with G. 
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Let {Xfc}fc>o be a stationary discrete time real-valued process. For a suitably chosen 
increasing real sequence {«„}, consider the time-scaled point process of exceedances 

N^{A):=i^{k/neA:Xk>Un}, AeB{R+). (1) 

As is well known (see e.g. Corollary 3.3 and Theorem 4.1 in [3] and also [7]), under 
broad assumptions, if the process iV„ has a limit as n — )■ oo, the latter must be a 
compound Poisson process, with some compounding law G = {gk}k>i on N. An 
example of an important class of processes for which G is non-trivial can be found e.g. 
in [2] (Scenario 4.3): in the case of asymptotically homogeneous Markov chains, G will 
be geometric. 

The main objective of this short note is to complete the characterization of the class 
of limiting distributions for ([T]) by giving an affirmative answer to the following natural 
question: Given an arbitrary distribution G on N, does there exist a stationary process 
{X,} for which G will be the cluster size distribution for the limiting point process of 
exceedances? This is achieved by constructing a two-dimensional stationary Markov 
chain (essentially, a regenerative process), then taking {X,} to be the component 
process of the chain and applying to it results from [7]. While in the case of a finite 
mean 

/i:=EC, C~G, 
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this task is next to trivial, the case /i = cxd is more interesting and is, in our opinion, 
worth presenting. 



Theorem 1 For any distribution G on N there exists a stationary process {X,} for 
which the time-scaled point process of exceedances ([1]) converges, for a suitably chosen 
{un 'I}, to a compound Poisson process with compounding law G. 

Proof. Without loss of generality, we can assume that 

g.c.d.{A; >l: gk = l} = l. (2) 

In the case when fi < oo, the construction is straightforward: it will be a regenerative 
process staying at randomly chosen levels during regeneration cycles of random lengths 
distributed according to G, the heights of the levels and the lengths of the cycles forming 
independent i.i.d. sequences. Then exceedances of a high level will automatically be 
clustered, with cluster size distributed almost as ( since it is quite unlikely to have two 
cycles with large heights one after another. 

More formally, let {Yk}k>i be an i.i.d. sequence of random variables that are, say, 
exponentially distributed with mean one, and let {Tk}k>i be a sequence of independent 
random variables which is independent of {Y,} and such that 

r„ = C, n>2- Fin= j) = -¥{(> j) = -y^g,^, j>l. (3) 

Putting Sk := we see that 

T]{n) := mm{k > 1 : Sk > n} 
is a (delayed) stationary renewal process, with a linear renewal function: 

oo 

F(S, = k) = -— = -, k>l (4) 

(see e.g. Section 9.2 in [Ij). Therefore the process 

Xk := Y^^k), k>0, (5) 

will also be stationary: {Xk}k>Q = {Xi+k}k>o for any / > 1, which immediately follows 
from the independence of the sequences {r,} and {Y,} and the well-known fact that, 

for any / > 1, one has S'^(/) — I = Ti. 

The process {X,} is clearly regenerative in the sense of [7], with i.i.d. cycles Gk : = 
{Xs,,_-^+j : < j < Sk — Sk-i}, > 1. It is obvious that, for the number of exceedances 
of {X,} of the level during the cycle Gk defined as 

6 := #{j : Sk-i < J < Sk, Xj > Un}, 

one has 

> ^ f Tk, if Yk > Un, /gx 

[0, otherwise, ^ ' 
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and so 

mk=j\^k>0) = g„ k>2. (7) 

Now it follows immediately from Theorem 3.3 in [7J that G will be the asymptotic 
distribution of the exceedances cluster size. 

Now turn to the case n = oo. The simple construction presented above won't work 
in this the regeneration cycles will have infinite mean lengths, but it can be 

modified by making the components of the random vectors (rfc, Yk), k > 1, dependent 
of each other in such a way that (i) the conditional distribution of given = y 
converges to G as ?/ — )■ oo (as we are interested in exceedances of high levels after all, we 
need to control the cycle length law only when there is an exceedance inside the cycle) 
and (ii) Er^ < oo. Observe that a different regenerative process with the cycle length 
distribution depending on the level height was used in |8] to give a counterexample 
concerning the interpretation of the extremal index, see Remark 2 below. 

To construct our version of the modified regenerative process, consider an i.i.d. 

sequence {(Cfc, ^fc)}A:>2 of random vectors with independent components, Cfe = 
being exponential random variables with unit mean, and let := Cfc A \Yk], k > 2, 
where, as usual, [x] := min{A; G N : k > x}. 

Thus defined i.i.d. random variables will have the distribution 



POD 

:=P(r2=j)= / 9jiv)e-''dv, j > 1, 
Jo 



where 

f 9j, j < M, 
9j{v) ■■= F(C A \v] =j)=< g^^-^ , J = \v] , g^:=Y^ 



0, j > \v] , i>m 



with the mean 



/•oo POO 

u:=Et2= / E{C A\v])e-''dv < / \v]e-''dv < oo. (8) 
Jo Jo 

Observe that, for t> > 

p(r2 edv\T2= j) = ^ ' P(F2 edv) = ^-^ e~^dv. (9) 

P(^2 =J) Pj 

Now we have to define (ri, Yi) (assumed to be independent of {(r^, Yk)}k>2) in such 
a way that the process (j5]) will again be stationary. It is obvious that, similarly to ([3]), 
Ti should follow the distribution 

nn=j) = -¥{T2>j) = -y2prn, J>1, (10) 

m>j 

SO we only need to specify how Yi depends on ti. Again, it's quite clear that the 
dependence should be the same as one has in the limit (as k — )■ oo) between Y^(k^ and 
the overshot Srf{k) — k. As the length Tj^{k) of the renewal interval 'covering' the point k 
is, loosely speaking, greater than that of a 'typical' r, our construction implies that 
F^(fc) should also be greater than a 'typical' Y (and, in particular, cannot have the 
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same distribution as Y2, cf. ( |T5l) ). The above informal argument leads to the following 
construction. 
Denote by 

-f{k) := k - Sr,(k)-i, xik) := Sr,[k) - k 

the defect and excess of the level k in the random walk {S*,}, respectively. As is well 
known (recall ([2]) and see e.g. Section 9.3 in [T]), 

limP(7(fc) = z, x(fc)=j) = ^, ^>0, J>1. (11) 

fe— >-oo h> 

Now consider a random vector (7, x, V) assuming that its first two components are 
integer- valued and such that P(7 = i, x = j) is given by the RHS of f lTT]) (note that 
the distribution of x coincide with that of ri from ffTOj) ). whereas 

¥{y edv\-f = i,x = j)= P(>^2 edv\T2 = i + j), i>0, J > 1. 
Finally, we set 

(ri,ro = (x,v^) 

and again consider the process {X,} defined by ([5]). 

It is obvious from the construction that, to prove that {X,} is stationary, it suffices 
to show that, for any / > 1, j > 1, m > 1 and f > 0, one has 

P(F^(0 e dv, Sr,(^i) -l = j) = P(ri e dv, n = j). (12) 

To do that, we first observe that 

F{Yi e dv, n = j) = F{V edv,x = j) 



J2f{v edv\^ = z,x = j)ni = hx = j) 

i>0 

yF{Y2edv\r2 = i+j)^ 



i>0 

e-^dv 



V 

i>0 



Y.3iA^^ (13) 



from (p. Now, for the LHS of ([l2]), we have 

P(r,(0 e dv, S^i^i) -i = ]) = P(n edv,n = i + j) 

i—i 00 

+ ^(^^+1 edv,Sr. = l- I, Sr+1 = l+j] 



j=0 r=l 
e~'"dv yr-^ 

i>0 
l—l 00 



+ 5Z 5^ P(^r = ^ - i) nYr+i G dv, r,+i = i+j), (14) 



1=0 r=l 
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where we used ( fT3ll to evaluate the first term on the RHS of the first hne. The inner 
sum in the last line of ( fT^ . using the first relation in and ([9]), is equal to 



P(r2 edv\T2 = i + ]] 



r=l 



Era 



e ^dv 



V 



■9i+j[v) 



Substituting these expressions into the RHS of (ITll) yields 



f(Yn(i) e dv, S^(i) - I = j 



e '"dv 



i-i 



i>0 



1=0 



e ^dv 



i>0 



which coincides with the RHS of (fT3|) thus proving the desired stationarity of {X,}. 

It remains to observe that our process {X,} is again regenerative, with the finite 
mean cycle length u, that still holds for it, and that, instead of ([7]), we now have 

sup \F{^k = j I >0)-gj\< g^^^^ . 

Clearly, g^u„] — )■ as n — )■ cxo, and so condition (3.9) of Theorem 3.3 in [7] will be 
satisfied. Therefore the claim of the theorem will hold in this well, showing 

that G = {gj} will emerge as the asymptotic distribution for the size of clusters of 
exceedances. Theorem 1 is proved. 

□ 

Remark 1 Observe that it follows from ffT3l) and (18]) that 



, ^ e ^dv sr^sr^ , n 



¥{¥2 E dv) 

E^2 



J2nr2>j\Y2 = v). (15) 



As, due to stationarity = Xi = Yi for any A; > 1, the above implies that the 
distribution of Xk in our stationary process is given by 



P(Xfc e dv) = P(Fi e dv) 



E (ra \Y2 = v) 
Et2 



F(Y2 edv), v>0. 



Remark 2 Recall that, under broad assumptions, the limiting distributional type 
for the maxima in a stationary sequence coincides with that for the maxima of i.i.d. 
random variables with the same marginal distribution, and that the changes brought 
by dependence can often be characterized by the so-called "extremal index" of the 
stationary sequence (see e.g. Section 2 in |1] and further references therein, and also [6]). 
For a sequence {x,} C M, set M„(x,) := maxi<fc<„ Xk, and let {X,} be an i.i.d. sequence 

with Xk = Xk- As it is well known (see e.g. Lemma 1.2.2 in [1]), for a sequence of 
constants {«,} and A G [0, 00], one has nP(Xo > «„) — )■ A as n — )■ 00 iff 

P(M„(X.) < Un) ^ e-\ (16) 
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It turns out that, in many cases (originally it was noted for strongly mixing sequences 
in [5]), if ( IT6l) holds then one also has 

P(M„(X.) < Un) ^ e-^^ 

for a some fixed value 9 G [0, 1] which is referred to as the extremal index of the 
sequence {X,}. 

Alternatively, the extremal index can be characterized by the fact that 1/^ is the 
limiting mean cluster size in the sequence of point processes ([1]) of exceedances over 
high levels, as it was shown under broad assumptions in |3j; a counterexample showing 
that this interpretation of 9 is not necessarily correct was given in [S] . For the stationary 
sequences that we constructed in the proof of Theorem 1, the extremal index is equal 
to 6' = 1/EC in both cases (^^ = when EC = oo), which can easily be verified by a 
direct calculation. 
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